CASA-Crowd: a Context-Aware Scale Aggregation CNN-based crowd counting technique

Ilyas, Naveed; Ahmad, Ashfaq; Kim, Kiseon

Title: CASA-Crowd: a Context-Aware Scale Aggregation CNN-based crowd counting technique
Creator: Ilyas, Naveed; Ahmad, Ashfaq; Kim, Kiseon
Relation: IEEE Access Vol. 7, p. 182050-182059
Publisher Link: http://dx.doi.org/10.1109/ACCESS.2019.2960292
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Resource Type: journal article
Date: 2019
Description: The accuracy of object-based computer vision techniques declines due to major challenges originating from large scale variation, varying shape, perspective variation, and lack of side information. To handle these challenges most of the crowd counting methods use multi-columns (restrict themselves to a set of specific density scenes), deploying a deeper and multi-networks for density estimation. However, these techniques suffer a lot of drawbacks such as extraction of identical features from multi-column, computationally complex architecture, overestimate the density estimation in sparse areas, underestimating in dense areas and averaging of feature maps result in reduced quality of density map. To overcome these drawbacks and to provide a state-of-the-art counting accuracy with comparable computational cost, we therefore propose a deeper and wider network: a Context-aware Scale Aggregation CNN-based Crowd Counting method (CASA-Crowd) to obtain the deep, varying scale and perspective varying features. Further, we include a dilated convolution with varying filter size to obtain contextual information. In addition, due to different dilation rates, a variation in receptive field size is more useful to overcome the perspective distortion. The quality of density map is enhanced while preserving the spatial dimension by obtaining a comparable computational complexity. We further evaluate our method on three well-known datasets: UCF_CC_50, ShanghaiTech Part_A, ShanghaiTech Part_B.rable computational cost, we therefore propose a deeper and wider network: a Context-aware Scale Aggregation CNN-based Crowd Counting method (CASA-Crowd) to obtain the deep, varying scale and perspective varying features. Further, we include a dilated convolution with varying filter size to obtain contextual information. In addition, due to different dilation rates, a variation in receptive field size is more useful to overcome the perspective distortion. The quality of density map is enhanced while preserving the spatial dimension by obtaining a comparable computational complexity. We further evaluate our method on three well-known datasets: UCF-CC-50, ShanghaiTech Part-A, ShanghaiTech Part-B.
Subject: deep learning; convolutional neural networks; densisty estimation; crowd counting
Identifier: http://hdl.handle.net/1959.13/1416576
Identifier: uon:37076
Identifier: ISSN:2169-3536
Rights: This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see http://creativecommons.org/licenses/by/4.0/
Language: eng
Full Text
Reviewed

Hits: 1883
Visitors: 2044
Downloads: 195

		Thumbnail	File	Description	Size	Format
View Details Download			ATTACHMENT02	Publisher version (open access)	3 MB	Adobe Acrobat PDF	View Details Download